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Abstract 

This paper is intended to be a pedagogical introduction to quantum Bayesian networks 
(QB nets), as I personally use them to represent mixed states (i.e., density matrices, 
and open quantum systems). A special effort is made to make contact with notions 
used in textbooks on quantum Shannon Information Theory (quantum SIT), such as 
the one by Mark Wilde flarXiv: 1106. 14451) 
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1 Introduction 



This paper is intended to be a pedagogical introduction to quantum Bayesian networks 
(QB nets), as I personally use them to represent mixed states (i.e., density matrices, 
and open quantum systems). A special effort is made to make contact with notions 
used in textbooks on quantum Shannon Information Theory (quantum SIT), such as 
the one by Mark Wilde p[]. 

QB nets are a generalization of classical Bayesian networks (CB nets) to quan- 
tum mechanics. CB nets have been hot topic in AI circles since the seminal work of 
Judea Pearl and collaborators that started in the 1980 's. A very complete book on 
CB nets is the one by Koller and Friedman[2]. 

Just like mankind has devised many names for the idea of God, there are 
other names for CB nets and variations of the idea. Others have called them causal 
probabilistic diagrams, factor graphs, probabilistic system diagrams, etc. To be sure, 
there are some differences between some of these diagrams and CB nets, but all seem 
to be striving to conjure up the same divine concept. Some of the close siblings of 
CB nets are discussed in Ref. [3], an IEEE magazine article by Loeliger. 

One variation on the CB net idea involves using graphs in which the arrows 
(a.k.a. directed edges) represent tensor indices and the boxes (a.k.a. nodes, vertices) 
represent transition matrices. In this approach, call it the tensor-graphs approach, 
each arrow coming out of a fixed node carries different stuff. In the CB nets approach, 
the nodes again represent transition matrices, but the arrows perform a very different 
job. Each arrow coming out of a fixed node carries the same stuff, namely the name 
of the node the arrow originates at. This difference might appear subtle or even 
insignificant to the untrained eye, but Bayesian network believers (like me) swear 
by it, claiming that it is clearer and more powerful than the tensor- graphs approach 
when dealing with probabilities. Bayesian network believers think that using tensor 
graphs to describe probability networks is like trying to fill round holes with square 
pegs. Even when the pegs fit, they don't do a very good job. 

Classical information theorists have been using tensor-graphs in their field for 
a long time. See, for example, the chapter on "network information theory" in the 
book by Cover and Thomas, Ref.|l]. Or look at the book by El Gamal and Kim, 
Ref.[S], which is devoted exclusively to the subject of network information theory. 

Quantum information theorists have been using tensor-graphs in their field for 
a long time too, at least since the seminal work by Schumacher and collaborators. For 
an early example of a paper by Schumacher that uses tensor-graphs, see, for example, 
Ref. [6], pubhshed in 1996. 

Some early quantum information papers, for example the seminal paper Ref. [7] 
by Bennett et al. also use a version of tensor-graphs, but they use them in a very 
loose, ambiguous, imprecise way. Ref. [7] even has some diagrams that sound like 
sacrilege to the ears of this Bayesian nets believer, such as diagrams that have nodes 
intended to represent buckets of sewage (Fig. 13 in Ref. [7]). 
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Not only do adherents to "the QB net way" espouse being good parents by 
treating all arrows (= children) coming out of a fixed node (= parent) the same. We 
are also very strict about our nodes. Each node has a numerical "value", and the 
whole graph also has a value which equals the product of the node values. Nodes 
aren't there just for decoration, or as mere labels with no numerical value assigned 
to them, or only to convey an abstract notion like "do this operation now" . 

Many quantum information papers don't use diagrams at all. They specify 
their quantum "protocols" or algorithms in terms of "pseudo code" . In my opinion, 
those papers would be much clearer if they described their algorithms using both, 
pseudo-code and QB nets, whenever this is possible. The recent book on quantum 
information theory by Mark Wilde [1] earns high marks in this regard, as it uses a lot 
of diagrams. Wilde's diagrams also have a fairly precise meaning. However, they are 
tensor-graphs, not the wonderful QB nets. 

My own work on QB nets started about 15 years ago with Ref. [8]. In that 
early paper, I dealt only with QB nets for pure quantum states. I've been using QB 
nets for mixed states since at least Ref. [9]. This paper repeats some of the ideas of 
Ref. [9] for QB nets of mixed states, with (hopefully) some small improvements. 

I've also written a Mac application that does QB nets called Quantum FogQ 

I also have a blog called Quantum Bayesian Networks (Ref.[TT]) in which I 
regularly post articles about Bayesian networks and quantum computing. 

Subsequent to Refs.[8l [9], other workers have devised their own types of dia- 
grams for doing quantum information theory. Their diagrams are very different from 
the QB nets in this paper. 

(a) Leifer and Poulin in Ref. [12], and later Leifer and Spekkens in Ref. [T3] . postulate 
some directed acychc graphs, but they assign a whole density matrix to each 
node of the graph. Furthermore, their node density matrices would be quite 
hard to calculate in practice, especially for complicated graphs. In comparison, a 
whole QB net is used to describe one density matrix. And the transition matrix 
that a QB net assigns to each node comes from the definition of a probability 
amplitude, a really basic thing that requires almost no calculation — certainly 
less calculation than the node density matrices of Leifer and coworkers. 

I like to think of QB nets as: A light container of data, useful as a data structure 
in computer programming. A vehicle rather than a destination. A transpar- 
ent, tidy way of organizing a lot of data in a pictorial way, prior to intensive 
calculation, not as being itself the outcome of a major calculation. 

^ I stopped Quantum Fog development in 2006. The application is still available for free at 
Ref. [lOj. Quantum Fog's last version (Version 2.0) is known to work with Mac OS X < 10.4. It 
probably works with some higher versions of Mac OS X too. Quantum Fog only does QB nets for 
pure states. That's not because QB nets can't deal with mixed states as some people think. It's 
only because I stopped developing Quantum Fog before I had a chance to add to it the capability 
to do mixed state calculations. 
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(b) Coecke (Ref.[Tl]) and collaborators use category theory to define their diagrams. 
In comparison, QB nets are much less abstract. Defining them requires no 
category theory, just standard, run-of-the-mill quantum mechanics. 

The QB nets in this paper are much simpler than the diagrams of (a) and (b). 
Simplicity can be a virtue in mathematics (consider, for example, abstract algebra's 
definition of a group, which is simplicity itself). QB nets are, however, complicated 
enough to be very expressive and useful; that is, they allow one to express numerous 
quantum mechanical concepts in a useful, practical and enlightening way. 

QB nets are a very parsimonious extension of CB nets to quantum mechanics. 
That is, the definition of QB nets is the smallest possible modification of the definition 
of CB nets that I can come up with, but enough of a modification so that one can 
do proper quantum mechanics with them. Keeping QB nets close to CB nets can 
be very fruitful, because much is already known about CB nets. And CB nets use 
classical probability so we can sharpen our classical understanding of a problem with 
them and then try to extrapolate that understanding to QB nets. QB nets retain 
the same structure as CB nets and can be reduced to them very easily, simply by 
applying the dephasing operator "cl" (defined below) to each node. Thus, QB nets 
make the connection to the classical case very direct and explicit. 

2 Basic Notation 

As usual, Z, M, C will denote the integers, real numbers, and complex numbers, re- 
spectively. For a,b E Z such that a < b, let Za^b = {a, a + l,a + 2, . . . ,b}. 

Let 5y = S{x, y) denote the Kronecker delta function; it equals 1 ii x = y and 
if a: 7^ y. 

For any matrix M G C^^'^, M* will denote its complex conjugate, M'^ its 
transpose, and = M*^ its Hermitian conjugate. 

Random variable^ will be denoted by underlined letters; e.g., a. The (finite) 
set of values (states) that a can assume will be denoted by Sa. Let Na = \Sg\. 

The probability that a = a will be denoted by P(a = a) or Pa{a), or simply 
by P{a) if the latter will not lead to confusion in the context it is being used. We 
will use pd{Sa) to denote the set of all probability distributions with domain Sg.. 

In quantum physics, a has a fixed, orthonormal basis {|a)a : a G Sg} associated 
with it. The vector space spanned by this basis will be denoted by T-Lg,- Other spans 
of "Ha that are not necessarily orthonormal will be denoted by Greek letters with 

^We will use the term "random variables" in both classical and quantum physics. Normally, 
random variables are defined only in classical physics, where they are defined to be functions from 
an outcome space to a range of values. For technical simplicity, here we define a random variable 
a, in both classical and quantum physics, to be merely the label of a node in a graph, or an n-tuple 
of such labels. Each node or random variable of a CB or QB net is akin to a spacetime event or 
a collection of them. 
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subscripts as in {\il^j)a}\/j- In quantum physics, instead of probabilities P{a = a), we 
use "probability amplitudes" (or just "amplitudes" for short) A{a = a) (also denoted 
by Aa{a) or A{a)). In place of P(a) > and Y.^P{a) = 1, one has Xla = 1- 

Besides probability amplitudes, we also use density matrices. A density matrix pa is a 
Hermitian, non-negative, unit trace, square matrix (or the associated linear operator) 
acting on T-La- We will use dm{'Ha) to denote the set of all density matrices acting on 

'Ha- 

If p^^a e dm{nx,a), and p^ = tr„(p„_2) = Ea("UPfi,£l«')ii ^ dm{Hx), we will say 
that px is a partial trace of px,a- Given a density matrix px^,x^,x.^,... e d'>T^{Hx^,x2,x^,...), 
its partial traces will be denoted by omitting its subscripts for the random variables 
that have been traced over. For example, p^^ — trx-^^x.^p^^^j^^^x.^. 

Sometimes, when two random variables a(l) and a{2) satisfy Sa{i) = Sa{2), 
we will omit the indices (1) and (2) and refer to both random variables as a. We 
shall do this sometimes even if the random variables a(l) and a(2) are not identically 
distributed! This notation, if used with caution, does not lead to confusion and does 
avoid a lot of index clutter. 

When we want to make explicit that an operator Q maps states in Tia to states 
in Hb, we will indicate this with a subscript (or superscript) as flb-<^a or as flb\a- In 
cases where — Sg,, we will sometimes write Via instead of the clearer but longer 

^ai-a or ilo|a- 

The tensor product symbol ® will often be omitted. Sometimes, when two 
vectors are being tensored, we will list the two vectors vertically instead of horizontally 
(the latter is more common in the literature). For example, we might write 

This doesn't lead to confusion as long as we indicate what vector space each vector 
lives in. (In the above example, \(f))a clearly lives in Ha and l'^)^ in 'Hb)- 

In this paper, we consider networks (graphs) with nodes. Each node is 
labeled by a random variable Xj, where j e Zi^^f. For any J C ^i,Ar, the ordered set 
of random variables Vj e J (ordered so that the integer indices j increase from 
left to right) will be denoted by Xj. For example, ^{2,4} — (^2'^)- We will often 
call the values that Xj can assume xj. For example, a;{2,4} = {x2,X4). We will often 
abbreviate x^ by just x, . We will often call the values that x, can assume x. . 



3 The Sandbox and its Dual 

For any expressions Q,{x) and p for which this makes sense, we will use the shorthand 
notation: 



h.c. 

X — >■ 



[ n{x) ]p[ n^{x') ] 



(2) 
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Here "h.c." is an abbreviation of "hermitian conjugate". We will usually use this 
notation with p = 1. This notation is especially useful when Q{x) is a long expression 
and we want to avoid writing it twice. We will refer to the space inside the set of 
square brackets to the left (resp., right) of p as the sandbox (resp., its dual or 
mirror sandbox). 



4 The Meta State 



QB nets for pure quantum states were first defined in Ref.[8j. A QB net for a pure 
state consists of a directed acyclic graph (DAG) and a transition matrix (a complex 
matrix) assigned to each node of the graph. The transition matrices must satisfy 
certain requirements. An example of such a pure state QB net is: 



(3) 



If a G Sa,b G Sb,cE Sc, Aa\b,c{o,\b, c) is the transition matrix associated with node 
a, Ab\cib\c) is the transition matrix for node b, and Ac{c) is the transition matrix for 
node c. We must have 

^ |A^|b,c(a|^c)p = 1 , 



b 

J2\Mc)\' = i- 

c 

Define the total probability amplitude ^a,fe,c(fl, b, c) by 

Aa^f,^c{a,b,c) = Aa\h^c(,a\b,c)Ab\c{b\c)Ac{c) 
Note that Eqs.Q imply 

J]|A„,b,,(a,fe,c)p = 1 . 

a,b,c 



(4a) 
(4b) 
(4c) 

(5) 
(6) 



Henceforth, we will sometimes omit the node subscripts from the probability 
amplitudes. For example, we might use A{a\b,c) instead of Aa\b,c{(i\b, c), if no confu- 
sion will arise. This is analogous to probability theory, where we often use P{a\b,c) 
instead of Pa\b,c{ci\b, c) or P(a = a\b = b,c = c) for a probability. 

More generally, suppose the graph has nodes X11X2, ■ ■ ■ ,Xj^- For j G Zi^^, 
a node Xj with possible states Xj G Sx. and with parent nodes a;^^^^ ) where pa{xj) C 
Zi^TV, has a transition matrix A{xj\xpa(x.)) which satisfies 
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Yl \M^jKai.^))\^ = I (7) 
—J 

for all Xpa{xj) £ ^Xp^{^^,y Let x, = {xi,X2, ■ ■ ■ ,xn) If the total amplitude A{x.) is 
defined by 

A{X.) = Yl M^jKaix^)) , (8) 

then Eqs.© imply 

J2\Mx.)\' = l. (9) 

X. 

Given any transition matrix of the form A[r\c), call r the row index and c the 
column indices. Call all A(r|c) entries with any r but fixed c, a column vector of 
the transition matrix. Eqs.(jl]) say that each column vector of a transition matrix is 
normalized. The column vectors may also be mutually orthogonal, in which case we 
say that the column vectors are orthonormal. For example, the transition matrix for 
node a in the QB net above might also satisfy: 

[ Aa\b,cia\b,c) ] 

a 

The isometry nodes defined below are another example of a case where the column 
vectors of the transition matrix are orthonormal. In general, it is not necessary 
that the column vectors be orthonormal. For example, for the marginalizer nodes 
defined below, they aren't. What is always necessary is that the total amplitude be 
normalized, as in Eq.([6]), so as to enforce the "unitarity" of quantum mechanics. 

The meta state of a QB net was first defined in Ref. A meta ket state is 
a pure quantum state represented as a ket or as a QB net. For example: 



h.c. 

6, c b', d 



(10) 



meta/ a,6,c 




\a)a 

a,b,c |'-')c 

where A{a, b, c) is defined by Eq.([5]). We assume the states {|a)a}va are orthonormal, 
and likewise for {\b)b}vb and {|c)c}vc- Note that each node of the QB net has its own 
ket and its own index that is summed over (i.e., bound). For example, node b has ket 
\b)b and bound index b. 
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The projection operator of a meta ket defines a density matrix which we will 
call the meta density matrix of the protocol under consideration. For example, 
the meta density matrix of the above meta ket is given by: 



(Pmeta)a,6,c = [ |^meta)a,6,c ] [ h-C ] (12a) 




[ h.c. ] . (12b) 



5 Generic Nodes 



We find it convenient to define certain special, generic types of nodes. 

Marginalizer nodes were first defined in Ref.[8]. In the current version 
of Quantum Fog, marginalizer nodes are usually denoted by black bullets, whereas 
non-marginalizer nodes are denoted by larger colored circles. In this paper, we will 
represent marginalizer nodes by writing a small delta near them. This node "deco- 
ration" or subscript is easy to draw by hand and also easy for the eye to spot. For 
example: 






(13) 



where a^^^ and a^2) have the same state space, call it Sa. Likewise, b/^i^ and 6^2) have 
the same state space, call it Sb- Note that the subscripts (1) and (2) are acting like 
a "time" index along a sort of timeline. For all a, a' G Sa and b, b' G Sb, 



^b,..\an„bnMa',b') 



(14) 

2(2>e(l>.2(l>'""' - " ' "' 

Thus some column vectors of a marginalizer node are equal to each other. To avoid 
index clutter, we will sometimes omit the indices (1) and (2) from the graph of 
Eq.f lT3|) . and draw instead the following graph: 
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(15) 



Grounded nodes are root nodes (i.e., nodes with no incoming arrows, only 
outgoing ones) which have a deterministic probabihty amphtude (i.e., an amphtude 
that equals 1 for just one of the possible states of the node and zero for all the other 
states). Grounded nodes will be indicated by writing a zero near them. Here is an 
example of a QB net with a grounded node: 




(16) 



wher^ 



A{b) = 5° , 



(17) 



for all b & Sh 



6 Isometries 

Consider the following QB net 




where j4(6|a) satisfies 



^ We are assuming that € Sb- It doesn't matter if the amphtude A{b) of node b equals (5^, or 
6^° where bo e Sb- Either way, it's stih a grounded node. An ahernative to writing a zero next 
to a node to indicate that it's grounded might be writing instead the letters "grd" or the electrical 
symbol for a ground. 



h.c. 

a — 7- a' 
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for all a, a' G Sa. The node h in the above QB net is called an isometry node, or 
just an isometry for simplicity. 

Eq.f lT^ is saying that the column vectors of the transition matrix yl(f)|a) are 
orthonormal. This is only possible if > Na. If Nb = Ng, (i.e., transition matrix is 
square), then the transition matrix is unitary. If Nb > Na (i.e., transition matrix is 
rectangular with more rows than columns), then we can use the well-known, so called 
Gram-Schmidt procedure to add more columns to the transition matrix ("extend it") 
to produce a unitary matrix. 

Since A^^ > Nb and the sets Sb, Sg, are finite, we may assume without loss of 
generality that Sb 3 ~Sa. For every 6, A G Sb, le10 



A{b\A) 



A{b\a) if A = ae Sa 

given by Gram-Schmidt procedure if A E Sb — Sa 



(20) 



Then 



beSb 



h.c. 

A^A' 



(21) 



for all A, A' e Sb- Thus, 



A{b\A) = {b\Ub\A) 



(22) 



where Ub is unitary. 

Here is a pictorial representation, in terms of QB nets, of the procedure just 
outlined for extending an isometry to a unitary matrix: 



(23) 



where Sb = SA^ Sa- 

Consider the following QB net 



where A{a,b\a') satisfies 




(24) 



A(a,6|a') ] 

aGSa beSt 



h.c. 

a' — )■ a" 



(25) 



for all a', a" G Sa. The node (a, b) in the above QB net is a special case of the general 
isometry node presented previously. Just as in the case of a general isometry, the 



■^We are using the symbol A both for an element A of Sb, and for amplitudes A{-). It's easy to 
tell which usage is intended in each instance, so this should cause no confusion. 
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transition matrix A{a,b\a') can be extended to a unitary matrix. Assume e S^- If 
we define 

A{a,b\a',b' = 0) = A{a,b\a') , (26) 
then we can find a unitary matrix Ua,b such that, for all a, a' e Sa and b, b' e Sb, 

{a\a \a')a 
A{a,b\a',b')= Ua,b . (27) 

\b')b 

Here is a pictorial representation, in terms of QB nets, of the procedure just 
outlined for extending an isometry to a unitary matrix: 




7 Freeing a Bound Index 

We've defined the QB net corresponding to the meta ket state as having for each node 
6: (1) an index b that is summed over (bound), and (2) a ket \b)b- One can free an 
index 6 of a node 6 of a meta ket state by multiplying that node by {b\ or \b){b\. One 
can use QB nets to represent these two operations. For example, if the meta state is, 

" ] = Y,A{b\a)A{a)\b)b\a)a_ , (29) 

a,b 

then 

= J]A(6|a)A(a)|a)„, (30) 

a 

and 

= \b)bY,A{b\a)A{a)\a)a. (31) 
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8 Classical Communication 



Quantum information theorists call "classical communication" the act of measuring 
an observable at one event and then using the result of that measurement to start a 
new event. Classical communication can be represented using QB nets. For example, 
if Sc = Sb, then 

= 5^ Ad|c(rf|f')^c(&)A|a(&|a)A,(a)M)rf|a), . (32) 

d,a 

9 (Coherent or Incoherent)- (Scalar or Vector) Sums 

For any a G dmiT-Lb^a), define 

trbipb,a) = Pa = ^[ {b\b ] pb,a [ h-c ] , (33) 

b 

dbiPb,a) = Pb^„a = [ \b)b{b\b ] Pb,a [ h-C ] , (34) 
b 

Slb(pfe,a) = P|a = [ 'Zbiblk ] Pb,a [ h-C ] . (35) 

Note that 

iib{l) = Nb, clfc(l) = l, s\b{l) = Nb. (36) 

Note also that the product of any two operators in the set F = {1, tr^, cl^, slfe} can be 
expressed in terms of a single one of them. For example, 

tlbC\b{pb,a) = ^^b{pb,a) , trbtrb{pb,a) = Nbtrb{pb,a) , (37) 

etc.. Hence, a product of any number of the operators in F can be expressed in terms 
of a single one of themjf] 

For each node 6 of a meta density matrix, there is an index b that is summed 
over and a ket \b)b- Furthermore, the is inside the sandbox, so we say that it's 
a coherent sum. Because the term being summed (i.e., the summand) includes the 
ket \b)b, we say it's a vector sum. 

If the Ylb outside the sandbox (and index b appeared in both the sandbox 
and its dual), we would call it an incoherent sum. If the summand did not include 
\b)b, we would call it a scalar sum. The operators tr5(-), clfe(-), slb(-) act on the meta 

^More formally, if we define c = clb, a = s\b/Nb, t ~ trb/Nb, and F — {1,c,<t,t}, then it is easy to 

check that for all f & F, fr = t, fa = a, and fc = < / ^ jl' _ Although F is closed under 

' ' •' ' •' \ T otherwise ^ 

composition, it is not a group. This is not surprising since c, p, a are irreversible transformations. 
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density matrix to change the coherent-vector sum over a node b to an incoherent- 
scalar, or an incoherent- vector, or a coherent-scalar sum. Let's illustrate this with an 
example. 

• Consider the following meta density matrix as an example. Note that in this 
meta density matrix, for the random variable b, there is a coherent-vector 
sum over the index b. 



Pk,^ = [Ea,i>^ib\a)A{a)\b),\a)^][h.c.] 



(38a) 
(38b) 



• "Tracing" (i.e., taking a partial trace of) the random variable b means doing an 
incoherent-scalcir sum over the index b. 



Mpb,a) = Pa = E [ ^aMb\a)A{a)\a)a ] [ h.c. ] 

b 

- [(D^®][h.c.] 



[ h.c. ] 



(39a) 

(39b) 
(39c) 



• "Classicizing", or "Making classical" the random variable b means doing an 
incoherent-vector sum over the index b. (This operation is also sometimes de- 
scribed as "dephasing" because we are throwing away some off-diagonal terms) . 



chiPb,^) ^ Pb_^,a = J2[^aMbW)A(a)\b),\a)^][h.c.] (40a) 

b 

= [(D^@][h.c. ] (40b) 



[ h.c. ] 



(40c) 



"Slashing" the random variable b means doing a coherent-scalar sum over 
the index b. 



slb{pb,a) ^ Pla = [J:a,bMb\a)A{a)\a)a ][h.c.] 

©— ® ] [ h.c. ] 



(41a) 
(41b) 

(41c) 
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Note that the operators in F are all irreversible transformations (except for 
the 1). The meta state is truly "at the top of the food chain": Once the operators in 
F take the meta state to something else, no operator or combination of operators in 
F can bring back the same meta state. 



10 Ensembles, Purification 



An ensemble is a set {■\/Wj\'ij)j)x}\/j where the weights Wj are non- negative numbers 
that sum to 1, and for all j, the states £ "Hx are normalized but they are not 

necessarily mutually orthogonal. The density matrix for this ensemble is 



Px = ^Wj[ ] [ h.c. ] 



(42) 



Define two ensembles as being equivalent if they have the same density matrix. This 
defines an equivalence relation. Elements of the same equivalence class are physically 
indist inguishable . 

The density matrix Eq. fH^ can be purified, meaning that it can be expressed 
as a partial trace of a pure state. One way of doing this is as follows. Clearly, also 
equals 



Thus 



where 



where 



and 



Px = trj 



[ h.c. ] . 



p^ = tTj_[ \^)xj ] [ h.c. ] , 



Ex,M^,3) 



\-^) x^ 



A{x,j) = A{x\j)A{j) , 



(43) 



(44) 



(45) 



(46) 



A{x\j) = {x\i;,), A{j) = ^ 



(47) 



11 Measurement Superoperators 

A superoperator is a linear operator that maps dm,{'Ha) into dm{'Hb)- 
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A measurement is defined as a set {K^lfi G S"^} of operators called 
Krauss operators that map states in Tia to states in Tib- We assume Na < Nb- 
{Na = \Sa\ = dim{l-La) and the same for 6). The Krauss operators must also satisfy: 

E = 1 ■ (48) 

Each Krauss operator can be used to define a superoperator $^(-) as follows. 
Let pa G dm(7ia), and, for each fj,, let (Tb|^ G dm{l-Lb)- Then define the measurement 
superoperator $^( ) by 



.(pj = P^i. = ' (49) 



^'(/^) ' 

where 

P{p) = tVbiK^PaKl) = tVaiKlK^Pa) • (50) 

Note that the P(//) are non-negative and 

E^(^) = i- (51) 

Note also that 

trb{pb\^^) = 1 (52) 

for all /i. 

A von Neumann measurement (for instance, = |/x)(/i|) is a measure- 
ment {K^}v/i that satisfies: 

Kl^K^, K,K,,^5i, E^'^^l- (53) 

Here are some other examples of measurements (you can check that Ylia ^a^a — 
1 for each example) 

• Tracing: Ka = {a\a 

• Making a node classical (i.e., dephasing it): Ka = \a)a{a\a 

• Classical (incoherent) communication: — \a)b{a\a, where Sa — Sb- 

• Coherent communication (only one Krauss operator): K — \ a)b{(i\a, where 
Sa — Sb- 
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A measurement {i^^jv^ can be extended to a unitary operator as follows. For 
every b E Sb, fJ- & Sfj,, a & Sg,, define 



A{b,fx\a) = {b\K^\a) . 



Since for all a, a' G Sg,, 



J2[A{b,f,\a)] 

b,ii 



h.c. 

a a' 



b,fi 



(54) 



(55) 



A{b, ^\a) defines an isometry. Assume G S*^. Let 

A{b, ij\a, jj,' = 0) = A{b, fi\a) . (56) 
Since Na < Nt, we can use Gram Schmidt to find a unitary operator Ub^^ such that 



A{b, fi\A, fi') 

for all b, A E Sb = Sa and n, /i' G S^. 



{b\b \A)a 



(57) 



12 RINNO (POVM) 

A POVM, which I prefer to call a RINNO, is a Resolution of the Identity by Non 
Negative Operators. Thus, a RINNO {-R^jv^t satisfies 

^i?^ = l, R^>0. (58) 

(Each Rn is a square matrix. A square matrix M is said to be non-negative, or said 
to satisfy M > 0, if v'^Mv > for all complex column vectors v.) 

Suppose Pa G dmiT-Lg), and for each ^, maps "Hq into itself. For each /x, 

define 



P(/i) = tiaiR^pg) . (59) 

By Eqs. (!58|) . the P(/i) are non-negative and satisfy Yliii^ilA — ^■ 

A RINNO {R^}\/^ can be constructed from a measurement {K^}\/^ by setting 

i?^ = KlK, (60) 
for each p. The definition of a measurement {K^jv^ and Eqs. (l60il imply Eqs.(l58l). 
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13 Channel Superoperators 



Suppose {K^\n G 5"^} is a measurement with Krauss operators : Tia Tib- Let 
Pa G dm{7ia) and at G dm{7ib)- Then define the channel superoperator $(■) b}|§ 



(61) 



Note that a channel superoperator is a weighted sum of measurement superoperators 
Pa can always be expressed as 



Pa = J2^^[ |^j)a ] [ h.C. ] 



(62) 



where the weights {wj}\/j are non-negative numbers that sum to one, and the states 
{\ipj)a}vj are all normalized but not necessarily mutually orthogonal. Note that for 
all b, b' eSb, 



E 



h.C. 

b^b' 



{b\i 



U, 



/Wi 



h.C. 

b^b' 



(63a) 



(63b) 



where, as discussed in Section [TTl ^7^^^ is a unitary matrix that extends the measure- 
ment {K^\p G S^}. 

Eq.flB21) for Pa and Eg. ( I63b[) for ab can be represented as follows in terms of 
QB nets: 



Pa = tTj 



[ h.C. ] , 



(64) 



o'b — tr^j 




[ h.C. ] , 



(65) 



^ Krauss showed that for any superoperator 
"completely positive" . 



•) is a channel superoperator iff $(•) is 
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where 

m = , (66) 

Mj) = , (67) 

A(6,/i|A,/i') = , (69) 

A(6|6',^0 = '^6', (70) 
A(/z|6',/i')=<. (71) 

14 Complementary Channel 

The channel superoperator $(■) given by Eq. (!6Tl) can be used to define a complemen- 
tary channel superoperator $'(■)• If ^i') is generated using a measurement {Kfj_}\/^, 
then we can find a unitary operator Ub.n such that 

4"-= • (72) 

Now define a measurement {Lh}\/b using the same unitary operator Ub^. 



{b\b |0) 



A 



^r-= • (73) 

Then 

ab = HPa) = J2K,paKl (74) 

and 
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We've already shown how density matrices pa and (t^ = $(pa) can be rep- 
resented by QB nets (see Eqs. fl64l) and (165|) ). Likewise, density matrices and 
Cfj, = ^'(ypfi) can be represented by QB nets as follows: 



in, 



[ h.c. ] 



(76) 




[ h.c. ] 



(77) 
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